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Abstract 

Let M be a smooth compact oriented Riemannian manifold, and let Am be the Laplace-Beltrami 
operator on M. Say 0^/6 5(R + ), and that /(0) = 0. For t > 0, let K t {x,y) denote the kernel 
of /(£ 2 Am)- We show that K t is well-localized near the diagonal, in the sense that it satisfies es- 
timates akin to those satisfied by the kernel of the convolution operator f(t 2 A) on R". We define 
continuous 5-wavelets on M, in such a manner that Kt(x,y) satisfies this definition, because of its 
localization near the diagonal. Continuous <S-wavelets on M are analogous to continuous wavelets on 
R n in 5(R n ). In particular, we are able to characterize the Holder continuous functions on M by the 
size of their continuous S— wavelet transforms, for Holder exponents strictly between and 1. If M 
is the torus T 2 or the sphere S 2 , and f(s) = se~ s (the "Mexican hat" situation), we obtain two ex- 
plicit approximate formulas for Kt, one to be used when t is large, and one to be used when t is small. 
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1 Introduction 

Let S(R + ) denote the space of restrictions to R + of functions in <S(R). Say ^ /o G 5(R + ), and let 

/(«) = 8f Q (a). 

One then has the Calderon formula for /: if c € (0, oo) is defined by 

dt f°° 
\f(t)\ 2 - = / t\f (t)\ 2 dt, 



then for all s > 0, 



t 



\f(ta)\*j = c<ao. (1) 
The even function h(£) = /(£ 2 ) is then in <S(R) and satisfies 

\W)\ 2 J = |<°°- (2) 
(In fact, all even functions in S(R) satisfying ([2| arise in this manner). 

Its inverse Fourier transform xj) = h is admissible (i.e. is a continuous wavelet). That is, for some d > 0, 

\F*Mlj = c'\\F\\l (3) 

for all FeL 2 (M"). Here, as usual, xp t (x) = t~ x i{){x/t). 
We prefer to write, formally, 

h = f(-d 2 /dx 2 )6; 

the formal justfication being that 

(f(-d 2 /dx 2 )sy = f(e) = h(0. 

Thus f(—d 2 /dx 2 )S is a continuous wavelet on R. 

Our program is to construct analogues of continuous wavelets, on much more general spaces, by replacing 
the positive number s in (JT|) by a positive self-adjoint operator T on a Hilbert space TL. If P is the 
projection onto the null space of T, by the spectral theorem we obtain the relation 



J™\f\ 2 {tT) d j=c{I-P\ 



(4) 



where the integral in Q converges strongly. Q will be justified in the next section. 

Taking T to be —d 2 /dx 2 on R leads to the continuous wavelet f(—d 2 /dx 2 )5 on R. (Of course, on R, 

P = 0.) 

We began our program of looking at more general positive self-adjoint operators T, in order to construct 
continuous wavelets, in our article [17]. There we took T to be the sublaplacian L on L 2 (G), where G is 
a stratifed Lie group, and thereby obtained continuous wavelets and frames on such G. In fact, in that 
context, f(L)5 was a continuous wavelet. (Again, in that context, P was zero.) (Our article |17j was 
motivated by the second author's thesis [35], where it was shown that if f(s) = se~ s , then the "Mexican 
hat" f(L)8 is a continuous wavelet on the Heisenberg group.) 
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In this article we will look at the (much more practical!) situation in which T is the Laplace-Beltrami 
operator on L 2 (M), where M is a smooth compact oriented Ricmannian manifold without boundary. 
We will construct analogues of continuous Schwartz wavelets in this context, and will obtain explicit 
formulas for them if M is the sphere or the torus. In a sequel article ([18]) we shall use a discrete 
analogue of Q to construct nearly tight frames in this situation, and show that they are appropriately 
well- localized (specifically, that they satisfy a space-frequency analysis). In both of these articles, P will 
be the projection onto the one-dimensional space of constant functions. 

We now summarize our results and methods. We discuss prior work on wavelets on manifolds, especially 
the work of Narcowich, Petrushev and Ward on the sphere [35], at the end of this introduction. 

In the model situation (JTJ - ^ on K, note that the kernel of the operator F — ► F * ip t is K t (x,y) :— 
t~ lr ip((x — y)/t). Since h(Q) = 0, we have J K t (x,y)dx = for all y and J K t (x,y)dy — for all x. Also 
K t {x, y) is smooth in t, x, y for t > 0. 

Motivated by this model case, we define continuous wavelets on M as follows. Suppose that the function 
K t (x, y) is smooth for t > 0, x, y £ M. For t > 0, define T t : L 2 (M) -> C°°(M) to be the operator with 
kernel K t , We define K t (x,y) to be a continuous wavelet on M, provided that for some c > 0, 



for all F 6 L 2 (M.), and that for any t > 0, J- M Kt(x,y)dfi(x) = for all y and J m K t (x,y)dfi(y) = for 
all x. (Here y, is the measure on M arising from integration with respect to the volume form.) 
It is then easy to see that, if K t is the kernel of f{t 2 A) (/ as before), where A = Am is the Laplace- 
Beltrami operator on M, then K t {x,y) is a continuous wavelet on M. In particular, ([5| follows easily 
from Q, simply by applying both sides of Q to F, taking the inner product of both sides with F, and 
making the change of variables which replaces t by i 2 in the integral. Note that K t (x, y) = [f(t 2 A)S y ](x), 
which is analogous to f(—d 2 /dx 2 )S being a continuous wavelet on R. 

This then gives an L 2 theory of continuous wavelets on manifolds. However, in practice, one wishes to 
go beyond this theory, looking at continuous wavelets on other function spaces, and discrctizing them to 
obtain frames. For this, one needs the wavelets to have further properties. Fortunately, as we shall see, 
if K t is the kernel of f(t 2 A), these properties are present. 

Specifically, let us return to our model situation; let us however work on K n , not just M. 1 . Then K t (x, y), 
the kernel of f(t 2 A), would be of the form t~ n ip((x — y)/t) for some ip £ S. (Here A is the usual Laplacian 
on E™, and ip = G, where G(£) = /(|£| 2 )-) For any N,a,(3, there would thus exist Cjy a a such that 



for all t,x,y. Such estimates are essential in the theory of wavelets on M. n . We therefore make the 
following definition: 

Definition 1.1. Let K t (x,y) be a continuous wavelet on M. Then we say that K t (x,y) is a continuous 
5-wavelet on M provided that: 

For every pair of C°° differential operators X (in x) and Y (in y) on M, and for every integer N > 0, 
there exists Cn,x,y as follows. Suppose degX = j and degy = k. Then 




(5) 





(6) 
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for all t > and all x, y £ M. 

Here d is the geodesic distance on M. We use the terminology "continuous 5-wavelet" to express the 
idea that these wavelets are analogous to Schwartz wavelets on R™. 



In Lemma 4.1 we will show the following key result: 



Lemma 4.1 If K t is the kernel of /(( A) (/ as before), then K t (x,y) satisfies (I6j) (and hence is a 



continuous 5-wavelet on M). 



Our proof of this lemma uses the theory of pseudodiffcrcntial operators (most crucially, a result of 
Strichartz [43 ), and Huygens' principle. 

In Theorem |5.5| we shall show that continuous S- wavelets are well adapted to the study of certain other 
function spaces. Specifically, we shall show the following generalization of a theorem of Holschneider and 
Tchamitchian ([25), who worked on the real line, and characterized the wavelet transforms of the spaces 
of Holder continuous functions (for Holder exponent strictly between and 1). 

Theorem 15.51 

Let K t {x,y) be a continuous S-wavelet on M, and, for t > 0, let T t be the operator on L 2 with kernel 
K t . Suppose F £ L 2 (M). Then: 

(a) If F is Holder continuous, with Holder exponent a (0 < a < 1), then for some C > 0, 

\\T t F\\ < Ct a (7) 

for all t > Q. (Here || || denotes sup norm.) 

(b) Conversely, say < a < 1, C > 0, and that F satisfies |?p for all t > 0. Then F is Holder continuous, 
with Holder exponent a. 



The proof of this result will be a straightforward generalization of the argument of Holschneider and 
Tchamitchian (|25j). 



Our construction of nearly tight frames in our sequel article [18] will also rely heavily on Lemma 4.1 In 
another sequel article (already available |19|), we will use Lemma 4.1 to show that one can determine 
whether F is in a Besov space, solely from a knowledge of the size of its frame coefficients. In a future 
article, we hope to study the same question for Triebel-Lizorkin spaces. (The analogous problems on R™ 
were solved in 121 and IT51.1 



It should be noted that in ([6]) we assume nothing about the t derivatives of K t (x,y), and no such infor- 
mation is needed in proving Theorem 
the following improvement on (|6]): 



5.5 



However, in fact, if K t is the kernel of f(t A), one does have 



For every pair of C°° differential operators X (in x) and Y (in y) on M, and for all integers m, N > 0, 
there exists CN %m ,x,Y as follows. Suppose degX = j and degF = k. Then 



t 



d(x,y) 



N 



XYK t (x,y) 



<C N . 



m,X,Y 



(8) 



for all £ > and all x,y e~M. 
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This is in fact an easy consequence of (j6J). For example ,J^-ftr t ( re, y) is the kernel of 2iA/(i 2 A), and hence 
equals 2tA x K t (x, y), which we can estimate by using (K3]). 

We should explain in what sense K t (x,y) (the kernel of f(t 2 A)) deserves to be called a wavelet. In our 
model situation on K n , K t (x,y) = t~ n ip((x — y)/t) behaves in evident, well-known ways under dilation 
and translation: 

K rt (rx,ry) — r~ n K t (x,y) for r > 0; and (9) 
K t (x + a,y + a) = K t (x, y) for a G R" (10) 

Now ([9]) says that, in the model case, K t (x,y) is homogeneous of degree — n in (t,x,y), so that for any 
m,a,0, t n+m+ ' a ' + '^'d^ l d^d^Kt{x, y) is homogeneous of degree 0, and is hence bounded as a function of 
(i, x, y). An analogous fact to this boundedness holds on M, by ([8| with N = 0. 



As for (10 1, a general manifold M has nothing akin to translations, but in Section 5 we shall discuss the 
situation in which M has a transitive group G of smooth metric isometries. In that case one can easily 
see that K t (Tx,Ty) — K t (x,y) for all T 6 G, which is analogous to (To|. (Manifolds with such a group 



G are usually called homogeneous. Obvious examples are the sphere and the torus.) 

Of course, to apply our algorithm in practice, one would need to compute the kernels K t approximately. 
We give some examples where this can be done, in Section 5. In the special case /(s) = se~ s , K t can 
be thought of, very naturally, as a "Mexican hat" continuous wavelet for M. (On M. n , the Mexican hat 
wavelet is a multiple of Ae~ A <5, the second derivative of a Gaussian, the function whose Fourier transform 
is |£| 2 e~'^' ■) In Section 5 we compute these continuous wavelets in the special cases where M is the 
n-sphere S n and the n-torus T n . For the 2-torus, we show that K t can be evaluated by use of either of two 
different sums, one (obtained from the eigcnfunction expansion) which converges very quickly for t large, 
and the other which (obtained from the proof of the Poisson summation formula) converges very quickly 
for t small. (The method can be extended to general n.) For the sphere, the eigenfunction expansion 
again gives a sum which converges very quickly for t large. When n — 2, for t small, by use of heat 
trace methods, we obtain a formula which converges very quickly, and which appears (from numerical 
evidence) to be an excellent approximation to K t . Specifically, in |37j . I. Polterovich obtains a completely 
explicit formula for the heat trace asymptotics on the sphere. (Earlier, less explicit formulae were found 
earlier in ([4]) and (0)-) It is not hard to see that, on the sphere, K t (x,y) := h t (x ■ y) is a function of 
x ■ y. Using Polterovich's result, we show how one can compute the Maclaurin series for 47r/i t (cos 9). In 
this manner we obtain an approximation 

-e 2 /4t 2 2 

47Tft t (cos0) ~ — 5— [(1 - —j)p(t,9) - t 2 q(t,0)}, (11) 



where 



and 



t 2 LV At 2 ' 

, ns t 2 t 4 it 6 t 8 2 A 2t 2 4i 4 4i\ 

p(M) = 1+ 3 + T5 + 315 + 315 + 4 ( 3 + T5 + 105 + 315 ) 

1 2t 2 u 4 u 6 e 2 2 8t 2 4t 4 
q{t > 0) -3 + 15 + TO5 + 3T5 + T ( T5 + 105 + 105 } 



Maple says that when t = .1, the error in the approximation ( 11 ) is never more than 9.5 x 10~ 4 for any 
9 6 [— 7r, 7r], even though both sides have a maximum of about 100. (To obtain rigorous bounds on the 
error is research in progress, which we expect to complete soon.) We derive a similar approximation to 



the heat kernel itself, when n = 2. The method can be extended to general n. Note that, if in (11) we 
approximate p ~ 1 and q ~ 0, we would obtain the formula for the usual Mexican hat wavelet on the real 
line, as a function of 9. 



G 



1.1 Historical comments 

A number of groups of researchers have been studying continuous wavelets and frames on manifolds, and 
some have obtained important real-world applications. While our method, based on the new formula ^ 
is original, certain of the ideas that we have presented in this introduction have arisen in other forms 
before. We now discuss the work of these other researchers. 



Weaker forms of Lemma 4.1 have appeared before; here is the history, as best we can determine it. If 
/ has compact support away from 0, in 1989, Seeger-Sogge [30] showed ^ modulo a remainder term 
that they must handle separately. (We would not be able to handle this remainder in the applications we 
seek.) Next, in 1996, Tao showed (|33]- Proposition 3.1 (ii)) the case j = k = of Lemma 4.1 under the 
restriction that h has compact support. (Recall that h(£) — /(£ 2 )- An assumption that h has compact 
support would not be natural in our context.) 

Most relevantly, in 2006 Narcowich, Petrushev and Ward ([35], [33]) showed a slight variant of Lemma 



4.1 



if M = 5", the sphere, provided / had compact support away from 0. (In their variant, K t (x,y) 
was the kernel not of /(t 2 A) but rather of f(t 2 A4), where M. is a particular first-order pseudodifferential 
operator which is similar to A. Specifically, A multiplies spherical harmonics of degree I by 1(1 + n — 1), 
while M. multiplies them by I 2 . This is a minor distinction, however.) 

Narcowich, Petrushev and Ward do not discuss continuous wavelets, but use spectral theory arguments 
to construct tight frames on S n . They then apply their variant of Lemma |4.1| for purposes similar to ours, 
including characterizations of Besov and Triebcl-Lizorkin spaces through frame coefficients on the sphere. 
Their frames have been dubbed "needlets", and have been used by statisticians and astrophysicists to 
study cosmic microwave background radiation (CMB). (See, for instance, [2], [31], [22] and the references 
therein.) We present a detailed comparison of their approach to frames and ours in our sequel article 

Returning to our own approach, but still restricting to the sphere, the most important cases to consider 
are the case in which / has compact support away from (the "needlet " case, essentially considered by 
Narcowich, Petrushev and Ward), and the case in which /(s) = s r e~ s for some integer r > 1. In our 
sequel article [IB], we construct frames from the latter /; we call these frames Mexican needlets. Needlets 
and Mexican needlets each have their own advantages. Needlets are a tight frame, and frame elements 
at non-adjacent scales are orthogonal. Mexican needlets, though not tight, are nearly tight; they have 
the advantage that one can work with them directly on the sphere, because of the formula pT| . (This 
formula is only for r = 1, but can be readily generalized to general r. As we said before, estimating the 



error in (11) is work in progress.) Assuming this formula, Mexican needlets have strong Gaussian decay 
at each scale, and do not oscillate (for small r), so they can be implemented directly on the sphere, which 
is desirable if there is missing data (such as the "sky cut" of the CMB). 

The statistical properties of needlets were investigated in [2]. Also, the statistical properties of Mexican 
needlets are already being investigated, by the second author in |29j , and by Lan and Marinucci in |28j . 
It would be worthwile to utilize both needlets and Mexican needlets in the analysis of CMB, and the 
results should be compared. 

A number of other researchers have studied wavelets and frames on manifolds. In all of the works 
mentioned below, when orthonormal bases were constructed, they are not known to give rise to a space- 
frequency analysis; and when frames were constructed, they are not known to be tight or to give rise to 
a space-frequency analysis. 
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Let us begin by discussing earlier works on manifolds, which contain some ideas related to those in this 
article. In alphabetical order: 

• Antoine, Vandergheynst, and collaborators ([I], |3J) have constructed smooth continuous wavelets 
on the sphere and related manifolds, by use of stereographic dilations (replacing the usual dila- 
tions), rotations (replacing the usual translations), and spherical convolution (replacing the usual 
convolution). They obtained frames by discrctizing these continuous wavelets. 

• Coifman, Maggioni, and collaborators ([BJ, [3D]) used the heat equation on manifolds for the rather 
different purpose of constructing orthonormal wavelet bases through a diffusion process, leading to 
a multiresolution analysis. They exploit the idea (which they attribute to Stein) of e~ tA being a 
sort of dilate of e~ A . 

• Freeden and collaborators ([TS], (TBJ) defined continuous wavelets on the sphere S 2 , and applied 
them to the geosciences. Their continuous wavelets were of the form f(t 2 A) for various / (not all 
in the Schwartz space), although they did not formulate them in that manner. One of their many 
examples was our Mexican hat wavelet, which they called the Gauss- Weierstrass wavelet of order 



zero. They did not have Lemma 4.1 so they restricted to an (extensive) L 2 theory of continuous 
wavelets. In the context of S 2 , they had, in particular, results equivalent to our Q. 

• Han ([23]i |Mj) constructed frames on general spaces of homogeneous type (including manifolds). 
His method is to discretize a discrete version of Calderon's formula in this general setting. He also 
used the T(l) theorem to estimate errors, 

The following researchers have also worked on wavelets and frames on manifolds. They used methods 
which seem unrelated to those in the present article. In alphabetical order: 

• Dahlke ( 8]) constructed an analogue of Haar wavelets on Riemannian manifolds. 

• Dahmen and Schneider ([S]) have used parametric lifings from standard bases on the unit cube to 
obtain wavelet bases on manifolds which are the disjoint union of smooth parametric images of the 
standard cube. 

• Schroder and Sweldens ([3PJ) used a lifing scheme to build wavelets on manifolds. This lifting 
scheme uses no invariance properties, and regularity information is not easily obtained. 



2 Applying the Spectral Theorem 



In this section, we give the proof of Q, as well as the proof of a discrete analogue, which will be used in 
our sequel article |18j to construct nearly tight frames. (The proofs are quite elementary, and the reader 
who is willing to accept Q can go on to the next section.) Specifically, we shall show: 



Lemma 2.1. Let T be a positive self-adjoint operator on a Hilbert space Tt, and let P be the projection 
onto the null space of T . Suppose I > 1 is an integer, fo £ 5(K + ), fo ^ 0, and let f(s) = s l fo(s). Set 
c=f °°\f(t)\ 2 f. 

(a) Then for any F £TL, 



lim 

*0+.JV^oo 



N dt 



F = c(I- P)F, 



(12) 
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Thus 



\f\ 2 (tT) 



dt 



lim 



/V 



\!\ 2 {tT) 



dt 



exists in the strong operator topology, and equals c(I — P) . 
(b) Suppose that a > (a ^ 1) is such that the Daubechies condition holds: for any s > 0, 

oo 

0<A a < \f(a 2 's)\ 2 <B a <oo, 



(13) 



Then lim 



y~]^__ M |/| 2 (a 2j T) exists in the strong operator topology on "H; we denote this 



limit by E^L-oo \f\ 2 ( a2jT )- Moreove 



A a (I-P)< £ \f\ 2 (a v T) <B a {I-P). 



(14) 



Remark (b) is a discrete analogue of (a), since the sum in (13 1 is a multiple of a Riemann s um f or the 

dt 
I 



integral J °° \}{st)\ 2 ^f = c, while the spectral theorem will show that it is valid to replace s in (j 1 3|) by T, 



to obtain (14 1. 

Proof. We prove (a) and (b) together. Let T — J °° XdP\ be the spectral decomposition of T; thus, in 
particular, P = P{o}- 

Observe that, by the spectral theorem, if g is a bounded Borel function on R + , then ||<7(T)|| < sup s>0 |<?(s)|. 
It follows readily that the integrand in ( 12 1 is a family of operators in B(TC) (indexed by f), which depends 
continuously on t £ [e, N) (in the norm topology on B(H)). (Use the mean value theorem for s in a suitable 



compact interval and the rapid decay of / at oo.) Thus J £ \f\ 2 (tT) y makes sense as a bounded operator 



on H. (As usual, it is defined as the unique bounded operator S with (S(p,^) = f e (\f\ 2 (tT)ip, for 
all ip, ip £ H.) 

For < e < N < oo, define g E ,N ■ [0, oo) — * [0, oo) by 



5e,w(A) 



N 



\f\ 2 (t\) 



dt 
T' 



(15) 



Then 



9s,n(T) = f '\f\ 2 (tT)j. 



(This follows from an elementary application of Fubini's theorem to J s J °° \f\ 2 (t\)d(P\ip, V 1 ) y - ) 
For M, N > we also define /im.jv : [0, oo) — > [0, oo) by 



JV 



(A)= J2 I/IV J ' A ); 

j=-M 



and we also set 



K\)= £ I/I 2 (« 2J ' A )- 



(16) 



j = -oo 
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To prove the lemma it is enough to show that g e n(T) — > c(J — P) (strongly, as e — * + and N — > oo) and 
that Hm,n(T) — > M^ 1 ) (strongly, as M,N — > oo). Indeed, (a) would then be immediate. (6) would then 
also follow at once from ( 13 1, which implies, by the spectral theorem, that A a (I — P) < h(T) < B a (I — P). 

To establish these conclusions, we first note that this strong convergence need only be proved on (I — P)H, 
since all the operators vanish identically on PH. Next we note that for any e,M,N, ||<7 e ,jviloo < c, 
||/iM,Jv|U < B a , and \\h\loo < B a , whence ||<? e ,jv(T)|| < c, \\h M ,N{ T )\\°o < B a , and \\h(T)\\ < B a , 
Thus the needed strong convergence need only be proved on a dense subset of [I — P)TL. 
Set 

V= U P [vM^ 

0<r)<L<oo 

observe that V is dense in (/ — P)TL (since P — P{o})- 

Thus, it suffices to show the following: fix < rj < L < oo, and say F E P\ v j/H. Then g e ^[T)F — > cF 
and }im,n(T)F — > h(T)F. 

This, however, is immediate from the spectral theorem and the evident facts that g e ^N — * c and Hm,n h, 
uniformly on [rj, L]. Although this uniform convergence is easily shown, for later purposes, we carefully 
express it quantitatively in the next lemma. (Note that we may assume a > 1; otherwise replace it by 
l/a.) □ 



Lemma 2.2. Notation as in Lemma 
integer, and let Mj — max r> o \r J f(r) 
Let || || denote the sup norm on I. 

(a) LfO<e< N, then 



2.1, and as in equations (151 through (16 1. Suppose J > 1 is an 
. Suppose < rj < L < oo, and let L be the closed interval [r),L]. 



\g e ,N - 4 < c L e 21 ' ( " 



N 2J> 

where we may take c L = i 2i ||/ o ||L/( 2 0; and C<j = Mj/(2Jr) 2J )- 
(6) If M, N > 0, and a > I, then 



L M.N 



h\\ < 



c 



where we may take c' L = (L 2i ||/ ||^)/(a 4( - 1), and C' n = Mj/[(a 4J - l)r] 2J }. 
Proof. Say s G /. Then 

\9eA»)-c\= f e \f\ 2 (^T+ r^^T- 
Jo 1 Jn 1 

(a) follows from noting |/| 2 (s<) < (Lt) 21 \\ f Q \\^ in the first integral, and that \f\ 2 (st) < M 2 j/(-qt) 2J in the 
second. Similarly, 

i^(*)-^)i= E \f\ 2 (^ S ) + Y / \f\ 2 ^ 2 ^)- 

j<-M j>N 

(b) follows from noting \f\ 2 (a 2: 's) < (La 2 ^ ) 21 \\ foW 2 ^ in the first summation, and that |/| 2 (a 2j s) < 
M 2 /(r]a 2: >) 2J in the second. This completes the proof. □ 

We can now express the strong convergence in Lemma |2.1| in the following very quantitative manner: 
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Lemma 2.3. Notation as in Lemmas 2.1 
sition of T. Then for any F £ H, we have 

\\[g e , N (T)-c)F\\ < 

and, if a > 1, 

\\{h M , N (T)-h(T)}F\\ < 



and 



2.2 



and again let T = XdPx be the spectral decompo- 



se 



C77 



AMI 



7 4ATJ 



|F||+2 C ||(/-P [7)iL] )F||, 



\F\\+2B a \\(I-P {nM )F\ 



(17) 



(18) 



(I — P[ v _l])F m the left side. This gives 



Proof. For (17 1, we need only substitute F 

< || [g e , N (T) - c] P [VtL] F\\ + \\g e , N (T)\\ \\V ~ P[v,L]) F W + C IK J - P Iv,l]) F W 

< \\[g E , N ( T ) ~ c} X[vM \U\F\\+2c\\(I - P [lhL] )F\\, 



|| [g e , N (T)-c]F\\ 

which, because of Lemma |2.2[ establishes (|17j) . The proof of (jl8|) is entirely analogous. 



□ 



( 17 1 and ( 18 1 are of significance for numerical calculations. Say for instance, that F — (I— P)F, and one 
wants to compute h(T)F to a certain precision. This involves summing an infinite series, so one instead 
seeks to compute [hM,N(T)]F for M,N large enough; how large must one take them to be? One first 
chooses 77, L so that the second term on the right side of (18 1 is very small. Then one chooses M,N to 
make the first term on the right side of ( 18 1 very small as well. 

We will return to this point in our discussion of space-frequency analysis, in our sequel article [18 . 



3 Preliminaries on Manifolds 

For the rest of the article, (M, g) will denote a smooth, compact, connected, oriented Riemannian manifold 
of dimension n, and fi will denote the measure on M arising from integration with respect to the volume 
form on M. In this section we assemble several well-known facts about analysis on M (preceded, below, 
by bullets), which we shall need in this article and in sequel articles. 

For x, y £ M, we let d(x, y) denote the infimum of the lengths of all piecewise C 1 curves joining x to y; 
then (M, d) is evidently a metric space. It is well-known (see, e.g., [34]) that there is a geodesic joining 
x to y with length d(x, y), but this fact, though basic, is not so relevant for this article. Most of what we 
need to know about the metric d is contained in the simple proposition which follows. 
For x £ M, we let B(x, r) denote the ball {y : d(x, y) < r}. 

Proposition 3.1. Cover M with a finite collection of open sets Ui (1 < i < I), such that the following 
properties hold for each i: 

(i) there exists a chart (Vi,<fii) with Ui C Vi; and 
(ii) 4>i(Ui) is a ball in M. n . 

Choose S > so that 3<5 is a Lebesgue number for the covering {Ui}. Then, there exist c\,C2 > as 
follows: 

For any x £ M, choose any Ui 3 B(x,35). Then, in the coordinate system on Ui obtained from 4>i, 

d(y,z) < c 2 \y- z\ 

for all y, z £ Ui; and 

ci\y~z\ < d(y,z) 

for all y, z £ B(x,5). 
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Proof. Say y, z G Ui. We work in the coordinate system on Ui obtained from </>,-. Then d(y, z) is at most 
the length (in the Ricmannian metric) of the straight line joining y to z, which is < ci\y — z\ for some 
C2- (By assumption (ii), that straight line is contained in Ui.) On the other hand, if y, z € B{x, 5) C Ui, 
we may take a sequence of piecewise C 1 curves 7^, joining y to z, whose lengths l(jk) approach d{y,z) 
as k — * 00. Surely d(y, z) < 28. Thus, for some N > 0, if k > JV, then l(jk) < 2<5. Therefore each point 
on 7^ is at distance at most 28 from y, hence at most 3(5 from x. Thus 7^ C Ui. Letting ||7j.(t)|| denote 
the length of the tangent vector j' k {t) in the Riemannian metric, we see that 

1(7*)=/ H(t)\\dt> Cl f \i k (t)\dt> Cl \z-y\, 



since z — y = ' r y' k (t)dt. Letting k — > 00, we see that d{y,z) > C\\y — z\ as well. This completes the 
proof. □ 



We fix collections {Ui}, {Vi}, {&} and also 5 as in Proposition 3.1 once and for all. 



• Notation as in Proposition \3.1\ there exist €3,04 > 0, such that 

c 3 r n <n(B(x,r))<c 4 r n (19) 
whenever x S M and < r < 5, and such that 

c 3 S" < y.{B{x, r)) < /i(M) < c 4 r n (20) 

whenever x G M and r > 5. 



To see (19 1, note that, since the collection {Ui} is finite, we may fix i and prove it for all x 
with B(x,35) C Ui. 

We work in the coordinate system on f/, obtained from <f>i\ in that coordinate system, Ui is a 
Euclidean ball, say {y : \y — xq\ < R}. (See Proposition 3.1 1. By compactness and a simple 
contradiction argument, there is an 77 > such that, for all x with B(x,3S) C {/,, one has that 
|z — xo| < R~ rj. Accordingly, for such an x, if |y — x\ < 77, then y G [/,. 

Thus, by Proposition |3.1[ we have that 



{y:\y-x\< min(r/c 2 , rj)} C B(x, r) C {y : |y - x\ < r/cx}, 



for all r < 6. ( 19 1 now follows from the fact that the determinant of the metric tensor g is bounded 
above and below on Ui. For (20 1, one need only note that if r > 6, then /i(B(x,r)) > /i(B(x,5)) > 
c 3 S n , while n(B(x,r)) < n(M) < \jj,(M)/S n ]S n < [fx(M) / d n ]r n . 

• (M,d, fx) is a space of homogeneous type, in the sense of 

Indeed, d is a metric and fi is a positive Borel measure, so one only needs to check the dou- 
bling condition: fi(B(x, 2r)) < C[i(B(x, r)) with C independent of x, r. But this is immediate from 
([19) and ([20). 

• i^or any N > n there exists Cn such that 

[l + d(x,y)/t]- N d t i(y)<C N t n (21) 

M 
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for all x G M and t > 0. 



(21 1 is proved by the "dyadic annulus" method. Fix x,t and let Aj — B(x,2H) \ B(x,2^ 1 t), 
so that, by (19) and (201, fi(Aj) < c^2 n H n . (21) now follows at once, if one breaks up the integral 
in (21 1 into integrals over B(x, t),A\,A%, . . ., and notes that Y^jLo 2^ n ~ N ' >: ' < oo. 

• For any N > n there exists C' N such that 



d(x,y)- N d^y)<C' N t 



d(x,y)>t 



i .n—N 



(22) 



for all x G M and t > 0. 



(22 1 follows at once from (21 1, once we observe that, if d(x, y) > t, then 
[d(x, y)/t]- N < C[l + d(x, y)/t]- N , if C = 2 N . 

For any N > n there exists C'^ such that 



[1 + d(x, z)/t]- N [1 + d(z, y)/t)- N d^{z) < C%t n [l + d(x, y)/t] 



1-iV 



(23) 



M 



for all x, y G M and t > 0. 



To see this, break up the integral into integrals over H\ — {z : d(x,z) < d(y,z)} and H2 = 
{z : d(x, z) > d(y, z)} (which, by the way, are hemispheres if M is a round sphere and x ^ y). 
By symmetry we need only estimate the integral over H\. But if z is in H\, d(x,y) < 2d(z,y), so 
[1 + d(z, y)/t]~ N < C[l + d(x, y)/t]~ N (where C = 2 N ). Thus the integral over Hi is no greater 
than C[l + d(x, y)/t]~ N J H [1 + d(x, z) /t]~ N dfi(z)). Estimating the latter integral through (21 1, we 
obtain ( 23 1 . 



• For all M, t > 0, and for all E C M with diameter less than Mt, if x$ S E, then one has that 
1 



M + l 

for all x £ E and all y G M. 

This is true simply because d is a metric 



[l + d(x,y)/t] < [l + d(x ,y)/t] < (M +1)[1 + d(x,y)/t] 



(24) 



4 Kernels 

A will now denote the Laplace-Beltrami operator on M (equal to —d*d, where * is taken with respect 
to the given Riemannian metric). We apply Lemma |2.1| to T = A. 

In order to carry out the plan explained in the introduction to this article, we must study the kernel 
K ^{x, y) of the operator f(tA) for / G 6>(IR + ), and we do so in this section. Before proving the crucial 
Lemma 
diagona 



4.1 



we will present some well-known information about K^.{x,y) for large t and also, off the 
for small t. (This information will be preceded, below, by t> signs.) 
Concerning the Laplace-Beltrami operator A, we first recall: 
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• A, as an operator on C°°(M), has an orthonormal basis of eigenfunctions {ui : < I < 00} ; all 
are in C°°(M). We may, and do, choose the ui to be real- valued. We order the ui so that the 
corresponding eigenvalues A; form a non-decreasing sequence. Then uo is constant, Ao = and all 
other A/ > 0. 

This easily implies: 

• A, as an operator on C°°(M), has a unique extension to a self-adjoint operator on L 2 (M) (which 



we also denote by A). Its domain is {F = J2 a i u i <= L 2 (M.) : l-^ a ;| 2 < an d for such an F, 
AF = J^haiui. 



Of great importance is Weyl's Lemma, which says [26 , in sharp form: 

• For A > 0, let N(X) denote the number of eigenvalues of A which are less than or equal to A 
(counted with respect to multiplicity). Then for some c > 0, N(X) = cA™/ 2 + 0(X^ n ~ 1 ^ 2 ). 

Since N(Xi) = 1 + 1, we conclude: 

• For some constants c\, C2 > 0, we have c\l 2 ^ n < A/ < cil 2 l n for all I. 

Since A m ui = Xf'Ui, and A m is an elliptic differential operator of degree 2m, Sobolev's lemma, 
combined with the last fact, implies: 

• For any integer k > 0, there exists Ck, > such that ||iti||c fc (M) — Ck(l + \) Uh . 

(In fact, by Sobolev's lemma, we may, for any e > 0, take Vk = (2k + n + e)/2n. By [41] , Lemma 
4.2.4, with A — A; in that lemma, we may in fact take vq = (n — l)/n.) 

From these facts one sees at once: 

• The mapping ^a;it/ — > (a;);>o gives a Frechet space isomophism of C°°(M) with the space of 
rapidly decaying sequences. 

For the rest of this section, say / G 5(M + ). We conclude: 

t> For t > 0, x, y £ M, let (for the rest of this section) 



DC 




(25) 



z=o 



Then K^q is the kernel of the operator f(tA), in the sense that if F S L 2 (M), then 



[f(tA)F](x)= [ K Vi (x,y)F(y)d f i(y), 



and K^i(x, y) is smooth in (t, x, y) (for t > 0, x, y £ M). 

From <\25\ , our estimates on the A; and the ||uj||cr*, and the rapid decay of Schwartz functions, 
we conclude: 
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D> If /(O) = 0, then for any M, N > 0, 



qN 



in G°°(M x M). 

Note also, that whenever I > 0, J M U;d/U = C J M iiiu^d/i = 0. Accordingly: 

> If /(0) = 0, then J M K^(x, y)dfi(x) = for all y e M, and / M K^(x, y)dfi(y) = for all x e M. 

Next we discuss the behavior of ATyj, as t — ► + . For this, we utilize some of the theory of pseudodiffcr- 
ential operators. 

We will need some facts about symbols on M. For m e R, let 5" l (]R) denote the space of standard 
symbols of order m, which depend only on the "dual variable" £. Let {|| || m ,jv} denote the natural 
nondccreasing family of scminorms defining the Frcchet space topology of S'j"(R); thus 

\\ P \\m,N= £ ™v[{l + \Z\) j - m \P U \0\ 
0<j<N t 

for p e SJ"(R). If G e 5 1 ™ (R) , let G t (£) = G(i£). It is evident that G t e 5™(R). In fact wc have: 

• Say i/iai G € S'j Tl (R). Lei fc &e the least integer which is greater than or equal to m; if k > 0, 
suppose further that G vanishes to order at least k at 0. Then for any N , there exists C > such 
that 

\\G t \\ m , N < Ct m (26) 

whenever < t < 1. 
To see this, say j > is an integer. One needs only note: 
(i) if < j < m (so that fc > 0) and t\£\ < 1, then 

|(G t ) (j) (OI < Gt J '(i fe - J ^| fe - J ) < Ct j {t m - j \t\ m - j ) < Ct m (l + |£|) m ~ J ; 
(m) if < j < m and > 1, then 

\{G t ) U \0\ < Ct j {l + t\£\) m - j < Ct j {t m - j \£\ m - j ) < Ct m (l + \£\) m -3; 

(Hi) if j > m, then 

l(G t ) ( %)| < CV'(1 + |^|)- (j - m) < Gi m (l + |e|)- (j - m) . 

This proves the claim. 

In particular, say G € (R) is arbitrary. Then G — G(0) e (R), and vanishes to order at least 1 
at 0. Accordingly, for any N, \\G t - G(0)||i ;iv < Ct for < t < 1, so that G t -» G(0) in 5f (R) as t -» 0+. 

We return now to M, where we need to look at the class OPS™ (M) of pscudodifferential operators 
of order m e [-00,00). As is familiar, T : G°°(M) -> G°°(M) is in OPS^ (M) provided that the 
following conditions hold for ip, ip <G G°°(M): 
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1. If swppip n suppip — 0, then the operator ipTip has a smooth kernel; and 

2. If supp^ U suppV> is contained in a chart (V, $), then ipTip is the pullback to M of a pseudodiffer- 
ential operator <S>*(ipTiJj) G OP5^ (K"). 

(Of course, here, (ipTtp)F — ipT(i/jF).) One places a Frechet space structure on OPS l f l (M) in a natural 
manner. (A brief sketch: First note that 0P5 _o °(M) is the space of operators with smooth kernels, so it 
has a natural Frechet space structure, inherited from C°°(M x M). For other m, one chooses a finite atlas 
{VF/j} on M with the property that if two charts in the atlas intersect, their union is contained in a chart. 
One chooses a partition of unity {fk} subordinate to this atlas. One notes that if T G OPS" l (M. n ), then 
T = J2ij <Pi Ti Pj- One notes that if W t n Wj = 0, then ipiTipj G OPS~°°(M), a Frechet space; while 
if Wi n Wj ^ 0, then W, nWj CV for some chart (V, and <&*{<pTip) G OPS^ (ffi"), also a Frechet 
space. Finally one defines seminorms on OPS , ™ (R ra ) of the form J2i j llVi^Vjlli where, in the summa- 
tion, one uses appropriate seminorms coming from OPS~°°(M.) if Wi fl Wj = 0, or from OPS] n (M n ) if 
Wj n Wj 0. The Frechet space topology thereby placed on OPS , ™ (M) is independent of all choices 
made.) 

One has the following theorem of Strichartz ([43 , or Theorem 1.3, page 296, of [45J: 

• Ifp{Q G SJ"(R), thenp(yfA) G OP5^ (M). 

In fact, the map p — ► p(VA) is continuous from 5™(M) to OP5™ (M). Indeed, by the closed graph 
theorem for Frechet spaces, it is enough to observe that if u G C°°(M), then the maps p — > (p(VK)u, ui) 
are continuous from 5J n (M) to C for every I, and this is clear. 

As a consequence, if ip,ij) £ C°°(M) are as in #1 above, then the map from 5™(M) to OPS~°°(M), 
which takes p to ipp(y/A)^, is continuous. If if, tp G C°°(M) are as in #2 above, then the map from 
55" (K) to OPS^ (R n ), which takes p to $*(<£p(VA)V0 is continuous. 

As usual, / G 5(R+); let G(£) = /(£ 2 ). Then G G 5(E). (In fact, if we allow / to vary, the map 
/ — > G is evidently a bijection between 5(]R + ) and the space of even Schwartz functions on K.) Now 
G^(0 = /(t£ 2 ), and G^(\/5) = f(tA). Since G^ -► G(0) in S\(M.) as i -> 0+, we infer: 

• /(tA) -» /(0)J zn OP5 1 1 i0 (M) as f -»■ 0+. 

Let £) denote the diagonal of M x M. We can now show: 

t> For any N > 0. 



m G°°((M xM)\D) 

To prove this, we adapt the arguments of [IS], page 313. Say <p, tp G C°°(M), have disjoint supports. 

Suppose further that </? = 1 in an open set U and ip = 1 in an open set V. 

It is enough to show that, for any G°° differential operator f onM, acting in the y variable, 

QN 

lim Y—-^K^t(x, y) (regarded as a function of x) = 



lim 77 — TT -/^ /7 



1G 



in C°°(U), uniformly for y £ V. 
But if x £ U and y £ V, then 



Y &N K Vi(x,y) = <p{x) [Yf^(tA)^A N S y )\ (x) = [S t (w y )} (x), say , 
where St is the pseudodifferential operator tpYf( N \t A)i/j, and w y — A N 5 y . 

For some s > 0, the set {w y : y £ V} is a bounded subset of i?~ s (M). Also, as t -> 0+, f {N) (tA) -> 
/W(0)7 in OP5 1 1 -0 (M), so F/W(tA) -> /W(0)Y in OPS'f+^M), if jfc = degF. But have 
disjoint supports, so the map i? — > ipRtp is continuous from OPS'j C g 1 (M) to OPS f 1 ~^° (M) . Therefore 
ft -»■ [/ (Ar) (0)^] ^ = in OPS^(M). Thus: 



S" t w y in C°°(M), uniformly for y £ V, 



as desired. 



Applying the mean value theorem in the t variable repeatedly to the last fact about K^, we see: 

t> Let E be any fixed compact subset of (M x M) \ D, and let U be the interior of E. Then for any 
k, N there exists Ck,N such that 

\\ K ^i\\c k {U) < Ck,Nt N 

whenever < t < 1. 

So far, nearly everything in this section has been well-known, but now we must consider the behavior of 
K t near the diagonal for small t. As we have explained and motivated in the introduction, this behavior 
is described by Q: 

Lemma 4.1. Say /(0) = 0. Then for every pair of C°° differential operators X (in x) and Y (in y) 
on M, and for every integer N > 0, there exists Cn,x.y as follows. Suppose degX = j and deg Y = k. 
Then 



d(x,y) 



t 



N 



XYK t (x,y) 



<C 



N,X,Y 



(27) 



for all t > and all x,y £ M. 

Proof. Of course d is bounded on M x M. Thus, by what we already know about K t , it suffices to 



prove (27) for < t < 1. In fact, with notation as in Proposition 3.1 it suffices to show that (27 1 holds 



whenever < t < 1 and d(x,y) < 5/3. Cover M by a finite collection of balls {B(zi,S/3) : 1 < I < J}. 
Then any pair of points (x,y) with d(x,y) < 6/3 lie together in one of the balls B(zi, 28/3). Thus, it 
suffices to show that (27) holds whenever < t < 1 and x,y £ B(zi, 25/3) for some I. Moreover, we may 
fix a positive integer M and prove that (p7| holds for all TV < M. 



Now K t is the kernel of f(t A) = G(tvA), where G(£) = /(£ )■ We claim that it is enough to prove 



(27 1 in each of the following two cases: 



(i) suppG C (—1,1); and 



(ii) G vanishes to order at least M at 0. 
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Indeed, any even G in S(R) can be written as the sum of two even functions G\ and G2, where G\ 
is of type (i) and G2 is of type (ii). (To see this, say that, for < I < M — 1, G^(0) = a;. It is 
enough to show that there exists an even function G\ with suppGi C (—1, 1), such that G\ (0) = a/, 



for < / < M—l, for then we can set G2 = G — G±. For this, see Lemma 7.1 in the Appendix (Section [7]).) 



In case (i), note that, by Huygens' principle, the support of K t , the kernel of 

/oo 
G{s)e- tst ^ds, 
-00 

is contained in {(x,y) : d(x,y) < t}. Thus, in this case, we may take M = 0. 

In either case (i) or case (ii), it is sufficient to show that, for every (f,ip 6 C^°(B(zi, 8)), we have that 



j-n+j+k 



d(x,y) 



N 



\XY [^{x)K t {x,y)^{y)]\<C 



whenever degX = j and degF = k, for all < t < 1, all N < M, and all x, y € B(zi,5). (Indeed, we 
could then take (p, ip = 1 on B(z[ 7 25/3).) Select Ui as in Proposition |3.1| with B(zi,35) C U i . Now, 
(p(x)K t (x, y)ip(y) is the kernel of the pseudodifferential operator <pG(W A)ip. We can use the coordinate 
map 4>i to pull this kernel over to M n , thereby obtaining a smooth, compactly supported kernel L t , with 
support in W 1 x M. n . Let us change our notation and now use x and y to denote points in M. n . By 
Proposition 3.1 it is enough to show that: 



i+M+i/31 ( \ x - y\ 



N 



\^L t (x,y)\<C 



for any multiindices a, (3, for all < t < 1, all N < M, and all x, y £ K™. 
Now let pt(x,Q denote the symbol of the operator with kernel L t . Then 



Lt{x,y) 



i(y-x)- 



Thus, d"d!?Lt(x, y) is a finite linear combination of terms of the form 



T 



(28) 



where | 7 |, \S\ < \a\ + \(3\. 

In case (i) we may take M — 0, so we need only estimate \T\, the absolute value of the term T in (28). 
It will be enough to show that \T\ < Ct- n -^ (for < t < 1), since Ct~ n -^ < Ct~ n -^-^. But 



ITI < 



\€\<i/t 

< C 

< C 



\?d 5 xPt (x,0\dt + 

A t t- n -^ + B t 
A t t- n -^ +B t t 



l«l>i/* 



„|7|+n-l r -M-»-l 
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where A t — sup x ^ \d^pt(x,£)\, and B t — sup^^ |£|' 7 ' +n+1 |<9fp t (:z:, But, by (26 1 and the continuity of 
the map p — > p(VA), from to OPS , ™ (M) in the cases m = and m = +n + 1), we find 

that A t < C (independent of < t < 1) and B t < Ci-H - "- 1 . Altogether |T| < Cr^-M, as claimed. 
This completes the proof in case (i). 

In case (ii), we need only show that for every n-tuple v with \v\ < M, we have that \(x — y) v T\ < 
Cft-n-M+M. Note that (x - y)"e 1 ^-^^ = cd^e t( - y -^^. Substituting this in the explicit expression for 
(x — y) v T, and repeatedly integrating by parts in £, we see that (a; — y) v T is a finite linear combination 
of terms of the form 

where |k| < | — ^ | , |x| < \v\ < M, and \j\ — \k\ + \\\ — \v\- Just as in our estimate for T above, we see that 



IT'I < 



< c 



^d d x df Pt (x,o 

r 
Ji 



'ifi>v* 

r |K|+ri-l r -|K|-ri-l^ r 



/* 



< c 



where now 



and 



B, 



A t = sup 

sup|e| |K|+ " +1 

x4 



Att -n-\ K \ + Btt 

d 5 x df Pt (x,o = su P (i + | ? |)-ixi+ixi a£d*pt(x,i) 



d 5 x d£p t (x,Z) < su P (l + l^l+^-M+M 
x4 



d 5 x dfp t (x,0 



to OPSf (M) in the cases m 



X\ 



But, by (26 1 and the continuity of the map p — > p(yA), from S" 
and m = |x| - (|«| + n + 1), we find that A t < Ct^ and B t < Cil*H«l-n-i. may be used here, 

since G vanishes to order at least M at 0, and both \x\ and |xl — + n + 1) are less than or equal to 
M.) Altogether \T'\ < Ct-"+lxl-l K l = Ct~ n -^+^, as claimed. This completes the proof. □ 



Remark 4.2. Note that, in Lemma 1^.1, the conclusion (21) holds even without the hypothesis /(0) = 0. 



provided t is restricted to lie in the interval (0, 1]. Indeed, after the second sentence of the proof of the 



lemma, we assumed < t < 1 and never used the hypothesis that /(0) = 0. Of course (21) holds also for 
t = 1 by continuity. 



5 Continuous S- Wavelets on Manifolds 

We now turn to our definitions of continuous wavelets and continuous 5-wavelets on M, which we have 
motivated in the introduction. 

Definition 5.1. Suppose that the function K t (x,y) is smooth for t > 0, 1,1/6 M. For t > 0, define 
T t : L 2 (M) -> C°°(M) to be the operator with kernel K t , so that for all F G L 2 (M) and all x € M, 

(T t F)(x)= f Kt(x,y)F(y)dfi(y). 

JM 

As usual, let P denote the projection in L 2 (M) onto the space of constant functions. Then we define 
K t {x,y) to be a continuous wavelet on M, provided the following three conditions hold, for some c > 0: 
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(i) For all F e L 2 (M), 



\T t Ff 2 j = c\\(I-P)F\\l; (29) 



(ii) J M K t (x,y)dii(y) = for all t > and all x e M (or, equivalently, T t (l) = for all t>0); 

(Hi) J- M K t (x,y)d[i(x) = for all t > and all y G M for, equivalently, T*(l) = /or a// £ > Oj. 

Definition 5.2. Suppose K t (x,y) is a continuous wavelet on M. We then say that K t (x,y) is a contin- 
uous S-wavelet on M, if the following additional condition holds: 

(iv) For every pair of C°° differential operators X (in x) and Y (in y) on M, and for every integer 
N > 0, there exists Cn,x,y as follows. Suppose degX = j and deg Y" — k. Then 

t n +^ k \(^f^) N XYK t (x,y)\ < C N , x .y (30) 

for all t > and all x, y £ M. 
We then have the following result: 

Theorem 5.3. Say f Q E S{R+), f ^ 0, and let f(s) = sf (s). For t > 0, let K t be the kernel off(t 2 A). 
Then K t (x,y) is a continuous S-wavelet on M. 



Proof. Of course, condition (iv) is Lemma 4.1 As we have seen, conditions (ii) and (iii) of Dcfinitio 5.1 
are immediate consequences of (251, as for condition (i), say F € L 2 (M). We need only take the inner 
product of both sides of (12 1 with F to see that, if c = / °° \f{t)\ 2 Y, then 

J OO \\f(tA)Ff 2 ^ = c\\(I-P)F\\l 

Replacing t by t 2 in this equation, we find that 

^ ||/(t 2 A)F||^ = |||(/-P)F|| 2 2 , 

which yields condition (i) at once. This completes the proof. □ 

As for properties of continuous wavelets, we first remark that it is a standard, simple matter to show the 
following result, which generalizes (12 1 (with T = A there): 

Proposition 5.4. Suppose K t (x,y) is a continuous wavelet on M, and, for t > 0, let T t be the operator 
on L 2 (M) with kernel K t . Then for any F € (I — P)L 2 (WL), we may reconstruct F through the identity 

T*T t Fj = cF. (31) 



Here the integral on the left side of (31) conveges unconditionally in L . 

Proof. Let Hi be the Hilbert space (7 - P)L 2 (M), and let H 2 be the Hilbert space L 2 (R+, Hi,dt/t). By 
our definition of continuous wavelet, we may define a bounded operator U : Tii — > H.2 by 

UF = (T t F) t>0 . 



20 



Moreover, we may define a bounded operator V : H.2 —> 'Hi by 

V(G t ) t>0 -■ 



00 dt 



where the integral converges uncondtionally in Hi. Indeed, let || || denote || Wh^ an d let S = {F e Hi 
\\F\\ = 1}. If E C (0, 00) is measurable and contained in a compact subset of (0, 00), we have 



T t *G t ^\\ = S np\ f (T t *G t ,F)^\= S up\ f (G u T t F) 
1 Fes Je t Fes Je 



dt 

7 







1/2 






< 


[/jlcllf 




sup | 








.Fes 



1/2 



but, by (29|, this is less than or equal to [f„ ||Gt|| 2 ^] 1 / 2 ; and the unconditional convergence follows. One 
now readily checks that V = U*. By |29l, (U*UF,F) = c\\F\\ 2 for all F £ Hi. Polarizing this identity, 
we find (31 1, as desired. □ 

As an example of the usefulness of continuous 5-wavelets, we now show the following direct analogue of 
a theorem of Holschneider and Tchamitchian ([251*): 

Theorem 5.5. Let K t (x,y) be a continuous S-wavelet on M, and, for t > 0, let T t be the operator on 
L 2 with kernel K t . Suppose F 6 L 2 (M). Then: 

(a) If F is Holder continuous, with Holder exponent a (0 < a < 1), then for some G > 0, 



|T t F|| < Ct° 



(32) 



for all t > 0. (Here || |j denotes sup norm.) 

(b) Conversely, say < a < 1, C > 0, and that F satisfies (32) for all t > 0. Then F is Holder 
continuous, with Holder exponent a. 



Proof. For (a), we just note: 

\(T t F)(x)\ = 



F(y)K t (x,y)dfi(y) 
{F{x)-F{y))K t {x,y)dn{y) 



<Cr n I d(x,yr[l + d(x,y)/t]- n - 1 - a d t x(y) 



< Ct a ~ n 

< Ct a 



-l + d(x,y)/t}- n - l dn(y) 



by (J21J, as desired. 

For (b), of course PF, being constant, is Holder continuous of any exponent. Since T t l = 0, we may 
assume that F = (I - P)F. 
Set g t = T t F, so that 

\\g t \\<ct a . (33) 

For x,y £ M, set 

K t!X (y)=Kf(x)=K t (x,y). 
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Thus, for all x, gt{%) — (K t , x ,F). Since we are assuming F £ L 2 , by Cauchy-Schwartz and ( 30 > , we 
obtain the additional estimate 

11.9*11 < Ct~ n . 

By Q, we have that \K t (x,y)\ < Ct~ n (l + d(x,y) /t)^ 1 , so by (pTJ, 



for any y. Thus, for any y, 



m\i<c 



\(T t *T t F)(y)\ = \(g u K«)\ < \\g t \\ \\Kfh < C\\g t \ 



Accordingly, by (33) and (34 1, for any y, 



\(T t *T t F)(y)\j<C 



t a_1 dt • 



r n_1 dt ) < c. 



By Proposition |5.4| we now see that for almost every y, 

f°° dt 
cF(y) = ^ T t *T t F(y)j, 

and from this, that F £ L°° . 

To complete the proof, we claim that it suffices to show that if d(y, z) < min(i, S), then 



\K t (x, y) - K t {x, z)\dfx(x) < Cd(y, z)/t. 



M 



For then, by (33 1, (34), (35 1 and (36 1, we would have, if d(y, z) < 5, then 

c\F(y)-F(z)\ < r [ \K t (x,y)-K t (x,z)\dn(x)\\g t \\^ 
Jo Jm. l 



C 



d(y.z) 



t a - l dt + d{y,z) / t a ~ 2 dt + d(y,z) I t 

d(y,z) 



-n-2 



dt 



< Cd(y,z) a , 



(34) 
(35) 



(36) 



as needed. 

To prove (36 1, choose [/,; 2 B(y, 36), and let us work in the local coordinates on Ui obtained from We 
use the mean value theorem. By (30 1, for any x £ M, there is point w x on the line segment joining y to 
z such that 

\K t {x, y) - K t (x, z)\ < C\y - z\r n -\l + d(x, w x )/t)- n -\ 



By Proposition |3.1 1 if w is any point on that line segment, 

d(y, w) <c 2 \y-w\< c 2 \y - z\ < c 1 c 2 d{y, z) < c x c 2 t. 



Thus the diameter of the line segment is at most 2cic 2 t, and so, by (24 1, we have 
\K t (x, y) - K t (x, z)\ < Cd(y, z)^ 1 ^ + d(x, y)/*)""" 1 . 



(36 1 now follows from (21), as desired. 



□ 
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6 Homogeneous Manifolds 

In this section look at the situation in which M has a transitive group G of smooth metric isometries. 
(Such manifolds are usually called homogneous manifolds.) Obvious examples of such manifolds are the 
sphere S n , where we take G to be the group SO(n + 1) of rotations, and the torus T n = (S 1 ) 71 , where 
we take G to be the group [SO(2)] n . 

If T G G and F is a function on M, we define the function TF on M by (TF) (x) = F(T- X x). Then 
T : L 2 (M.) — ► L 2 (M.) is a unitary operator which commutes with the Laplace-Beltrami operator A. 
Consequently, as operators on L 2 (M), f(t 2 A) commutes with elements of G for any bounded Borel 
function / on K, and in particular, if / G <S(K), which we now assume. 
Thus, if T G G, F G i 2 (M) and x G M, we have 

/ K t (Tx,Ty)F(y)dn(y) = f K t (Tx,y)F(T- 1 y)d^(y) 

= [f(t 2 A)(TF)](Tx) 
= T([f(t 2 A)(F)])(Tx); 

but this is just [f(t 2 A)(F)](x) = J M K t (x,y)F(y)dfi(y), so 

K t (Tx,Ty) = K t (x,y) (37) 

for all x,y £ M. Thus, since G is transitive, if xq is any fixed point in M, once one knows K t (xo,y) for 
all y. then one knows K t (x,y) for all x.y. 

In the analogous situation on K ra , K t (x, y) has the form t~ n ijj((x~y)/t) for some ip £ S, and so K t (x, y) = 
K t (Tx,Ty) for any translation T on K n . Equation (37 1 is a natural analogue of this fact for M. 
It is interesting to note that one has 

K t (x, x) = tr(/(t 2 A))/vol(M) (38) 



for all x and all / G 5(M + ). Indeed, by (37 1, K t (x,x) is constant for x G M. Accordingly 

vol(M)K t (x,x) = [ K t (y,y)d^(y) = J2f(t 2 ^) [ \ui{y)\ 2 d^y) = V /(*%) = tr(/(i 2 A)) 

,/M ( JM j 

as claimed. 

Say now c > 0, and let us look at the special case 

f{s) = {s/c)e-^ c \ (39) 

We have tr(/(t 2 A)) = tr((£ 2 /c)Ae~(' 2A )/ c ). A well known fact, usually associated with the heat kernel 
approach to index theorems ([55], [20], [57] and [2T], pages 58 and 316), is that as s — > + , 

CO 

tr(e" sA ) ~ ^ s m - n/2 a m , (40) 

?n=0 



where 



a = (47r)-" /2 vol(M). 



(41) 



23 



Differentiating with respect to s one finds that 



tr(Ae- A )^^(^-m) S m - 1 -"/ 2 a m . 

m=0 

To lowest order, then, 

K t (x,x) = tr((t 2 / C )Ae- ( * 2A)/c )/vol(M) ~ ^S,- (42) 

Again let / be general, but now let us look at the special case M = T", the torus. We write T" — 
{(e 2, ™ ri , . . . ,e 27Tlr ") : — 1/2 < r\, . . . ,r n < 1/2}. Here an orthonormal basis of eigenfunctions is given 
simply by { e 27rim r : m £ Z"}. The Laplace-Beltrami operator A is just — 'Y^a =1 {d / dri) 2 , and the 
eigenvalues are given through Ae 27 ™ m ' r = Air 2 \\m\\ 2 e 2lTlm ' r . (Here ||m|| 2 = m\ + ...m 2 .) The kernel 
K t (r, s) of f(t 2 A) is given by 



Mr, a)= J2 /(4^ 2 |MlV 



|2^ p 27rim-(r'-s) 

Thus, if Fe L 2 (T n ), 

,1/2 ,1/2 

[f(t 2 A)F](r) = / . . . / F(s)Mr, • ■ ■ da n = [F * h t {r)\ 

J -1/2 J-l/2 

where 

h t (s)= /(47r 2 < 2 ||m|| 2 )e 2 — s , 

m6(Z) n 

and * denotes the natural convolution on T n . 

We specialize now to the case n = 2, /(u) = ue _u / 47r /47r 2 . We free the letter n for other uses. For t > 
define the functions £/*, Vt : R — > C by 

oo 

n— — oo 

oo 

F t (y)= {nt) 2 e- wt2n2 e 2 ™ y . (44) 

n— — oo 

It is then easy to calculate that 

h t (si,s 2 ) = U t { Sl )V t {s 2 ) + U t {s 2 )V t ( Sl ). 

h t is the "Mexican hat" on the torus T 2 . One can use these equations to draw its graph. One of 
course needs to approximate the series in (43 1 and (44) by finite sums. This is a simple matter if t 
is greater than 1, but if t is small the series do not converge very quickly. Fortunately, however, we 
can give alternative series expansions for U t and V t which do converge very quickly for < t < 1. We 
do this by using the Poisson summation formula, or rather, its proof: if g S 5(M), then the periodic 
function G(x) = Y^=-oo9( x + n ) nas Fourier series X) n =-oo g(n)e 2mnx , and hence, G(x) equals the 




Figure 1: h t on T 2 for t = 2 (left), t = 1/2 (middle), t = 1/8 (right) 



latter series. (Here we use the inverse Fourier transform g(x) — J_ g(£,)e ^"^d^.) Applying this with 
g(y) = e _7r * 2;/2 , we obtain the formula 

oo 

U t (x) = ~ e~< n+ ^^. (45) 

n— — oo 

Taking instead g(y) = (tyYe^^y 2 , we obtain the formula 

1 °° 1 _L 

V *M = \ E - (^) 2 )e-<^) 2 /* 2 . (46) 



( 45 1 and M6J converge very quickly for t small, making it practical to draw pictures of ht for t small. We 
include pictures, obtained by using Maple, of the Mexican hat functions nht(ri,r2) —1/2 < rx,T2 < 1/2, 
for t = 2 (Figure 1, left), t = 1/2 (Figure 1, middle) and t = 1/8 (Figure 1, right). Note that the 
characteristic Mexican hat shape is obtained. To keep the pictures uncluttered, we have omitted the 
axes. However, one can ask Maple to evaluate t 2 Trht(0, 0) for various values of t. When t = 2, it is .00070; 
when t = 1, it is .59017; when t = 1/2, it is .99984, and when t = 1/8, it is 1.00000. This is consistent 



with the predictions of (39 1 with c — At:, and (42) 



Now let us look at the special case M = 5™, and let / be general for now. Let N = (1, 0, . . . , 0), the 
"north pole". We shall now calculate K t (N,y) as an explicit infinite series. (As we have explained, we 
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will then know K t (x,y) for all x,y.) Recall ([H]) that we may write L 2 (S n ) = ®i >0 Hi, where Hi is the 
space of spherical harmonics of degree I. If P e Hi, then 

AP = i(( + ii- 1)P. 

Within each space Hi is a unique zona? harmonic Zi, which has the property that for all P £ Hi, 
P(N) = (P, Zi). In particular, P is orthogonal to Zi if and only if -P(N) = 0. 

We wish to use (251 to evaluate K t (N,y). To do so we choose an orthonormal basis for each H\, one of 
whose elements is Zi/\\Zi\\2- Then any other element of this orthonormal basis vanishes at N, so we find 



K t (N, y) = J2 ftflQ + n- l))Zi(W)Zi(y) /\\Z, 
But surely Zi(N) = (Zi, Zi), so we simply have 

oo 

K t (N,y) = Y / f(t 2 l(l + n-l))Z l (y) 



ml 



(47) 



1=0 



However, Zi(y) is known explicitly. In fact ([42]), if oj n is the area of S n , then for some constant q, 

Z l {y) = qP^fe), (48) 

where y — (y±, . . . ,y n ), A = (n — l)/2, and P ; A is the ultraspherical (or Gegenbauer) polynomial of degree 
I associated with A. The P A may be defined in terms of the generating function 



(1 - 2rr + r 2 y x = ^ P ; A (r)r 



In particular, if t = 1, we see that 



(l-r)-<»^ = £i?(l> 



1=0 



so that 



(49) 



On the other hand, 



Z,(N) = KJ-'dimW, = K]- 1 [("+') - (" + /~ 2 )] = di 



Comparing (48 1,(50 1 and (511, we see that 



ci = di/bi 



(n+l 



_ n + 2l-l 

n ( n T 2 ) ' ~u n (n-iy 



From (47 1, we find 



K t (N,y) = Y / {n + 21 il /^ + n-l))^) := h t ( Vl ). 
t=t 



(50) 
(51) 

(52) 
(53) 
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If x € S n , we can choose a rotation T with Tx = N. If also y G 5", then by (37 1 



# t (z,y) = K t (Tx, Ty) = K t (N,Ty) = ftt((Ty)i) = /i { (N • Ty) = h t (Tz • Ty) = h t {x ■ y). 
Thus, if Fe L 2 (S n ), 

[f(t 2 A)F](x) = ( F(y)h t (x-y)dfi(y), 



S" 



the spherical convolution of F and the axisymmetric function fit- 
Let us now take f(s) = se~ s . From (53), we find 



W = = £ ^ + ^-l)(,+ 2/-l) t2e _^ +n _ 1) 

f— ' oj n (n — 1) 

2=0 v ; 



(54) 



One can use (54) to draw the graph of h t for any t > 0. We do so, in the most practical situation, n = 2. 
We can go all around a great circle by using spherical coordinates, in which yi = cos(6*), with 9 going 
from — 7r to 7r. We draw these graphs when t = 1 (Figure 2, left), t = .1 (Figure 2, middle) and t = .05 



(Figure 2, right). Actually, since (42 1 predicts 

47r/i t (l) = 4vr J fs: t (N,N) 



V* 2 



we draw the graphs of 4tt h t (cos(8)) instead, with 9 going from — ir to ir on the horizontal axis. The 
pictures do bear out the relation 47r/it(l) ~ 1/t 2 . We also see the characteristic "Mexican hat" shape, 
familiar from the analogous situation on R 1 , where K t (x,y) is the Schwartz kernel of f(—t 2 d 2 /dx 2 ). 
Of course, in that situation, [f{-t 2 d 2 /dx 2 )F] = f(t 2 £ 2 )F(£) = i 2 £ 2 e-* 2 « 2 F(£) for F € 5(M), so 



ift(x, y) = t lr tf}((x — y)/t), where ip is the second derivative of a Gaussian. 
The series (54 1 converges quickly for t large, but not if t is small. Therefore, as in the case of the torus, for 



computational purposes, it is important to have a quickly converging alternate expression for this function 
for small t. Since the pictures indicate that 47r/i t (cos 9) is negligible outside a small neighborhood of 9 = 
if t is small, one would assume that it is only necessary to compute the Maclaurin series of h t . It is very 
fortunate that any number of terms of this series can be computed explicitly, , through use of the work of 
Polterovich [SS], together with some additional insights. 

In fact, what Polterovich found in [38j was the heat trace asymptotics on the sphere, i.e. explicit formulae 
for the a m of (40 1 for the manifold S n , as sums of explicit finite series, for m > 1. (Earlier, less explicit 
formulae were found earlier in ([!]) and ([5]).) Using Polterovich's formula to evaluate some of these a m 
on S 2 , (it is easiest to use Maple), we find that 



tr(e- sA ) - - + - + 



s 

15 



315 315 1 ; ' 



(55) 



there would be no difficulty in evaluating more terms. (Recall that a is given by (41 ), for general M.) 



Using this formula we are going to evaluate the first few terms of the Maclaurin series of 47r/it(cos 9), and 
we will show how any number of terms could be obtained. 

Let Jt(x,y) be the kernel of e _t A on S 2 , so that, in particular, by (38 1, J t (x,x) = tr(e~* A )/4-7r. From 
(53 1, with f(r) = e~ r , we find that if s = t 2 , then 



X 

E 

(=0 



(2Z + 1) 
1 

47T 



-^P^yi) := 9t{Vi). 



(56) 



27 




28 



and similarly, from ( 54 1 , 



S z — ' 47T S 



(57) 



We would like to understand the Maclaurin series of 47rg t (cos (9) and 4-7r/i t (cos 9) . To this end, we shall 
use the following lemma. 



Lemma 6.1. Suppose U S C (S n ) is a function of x± only, U(x) = u(x\), say, and that u is in fact C 
in a neighborhood of [—1, 1]. Then AU is also a function of X\, and 

(AU)(x) = nxxu'(xi) - (1 - x\)u"(xi) 

In particular, if n = 1, one has 

d 2 



(58) 



d6 



2L u(cos0)] = (cos 9)u' (cos 9) - (sin 2 6»)m"(cos6»). 



(59) 



Proof. By the results of [H], A = — V <fc Wf k , where 



W jk = x. 



d_ 
dxk 



Xk 



dxj 



if 1 < j, k < n+ 1. Since U is a function of x\ only we compute 

n+l 



AU 



k=l 
n+l 



-J2[-xiu' + x 2 jU "] 
i=2 



which proves (58), since on the unit sphere, ? x 2 - — 1 — x 2 . Of course (59 1 is elementary, but we note 

that it can also be viewed as a special case of ( |58[ ), since, in polar coordinates, the spherical Laplacian 

i ri 2 — 

on S is just —-jffi- This completes the proof. □ 



In particular, if we set x\ = 1 in (58 1, and 6 = in (59 1, we find that 

-^[«(cob d)]^ = i(AC0(N), 

since both sides equal u'(l). 



(60) 



Remark 6.2. We shall show below, in Lemma 6.3 that if u is C 2m on an open interval containing [—1, 1], 
one can similarly obtain ^ g2m [u(cos 6)] \g = o from a knowledge of (A l U) (N) for i = 1, . . . , m. Thus, if u is 
C°° on an open interval containing [—1, 1], the entire Maclaurin series ofu(cos9) (regarded as a function 
of 9) is completely determined from a knowledge o/(A J [/)(N) fori > 0. (Of course, u(cos#) is an even 
function of 9, so all of its odd-order derivatives vanish at 0.) 
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For now, let us apply this lemma to u — g t as in (56 1 or ~ht as in (57 1. To do so, we first explain why 
gt and h t have C°° extensions to an open neighborhood of [—1,1]. For this, it is evidently sufficient to 
show that for every n > 1 and every m > 0, there exists c„ jm with 



d ' m p("-l)/2/ s 



n+2m— 1 



From the generating function formula ( 49 1, we see the classical formula that the derivative of P t x is 



2XP l Z\. Thus we may assume m = 0. By (481 and (521, we need only show that, on S n , \Z t (y)\ < Cl n , 
and for this, by (51 1, we need only show that \Zi(y)\ < Zi(N) for all y G S n . But we may choose an 

Zi(N), 



\zi\\l 



orthogonal transformation T on W l with TN = y, and then 

Z l (y) = Z ; o T(N) = (Z, o T, Z,) < \\Z t o T|| 2 ||Z,|| 2 
as claimed. 

Let us return to S* 2 . Suppose as usual that t > 0, and again put s = t 2 . By (60 1 and (56 1 we have that 

d 2 



de 2 



47r[3t(cos6»)]| e= o 



4/T 



(A y J^)(N,N) 



^( N ' N ) 



47T d 

Yds' 



Thus, the first few terms of the Maclaurin series of 47r[gt(cos 8)] are 



47r 5i (cos0) ~ tr(e- sA ) + — ^-tr(e- sA ). 

4 ds 



Similarly 



d 2 1 
-— 47r[-/i t (cos0)]| fl=o 



^(A V ^)(N,N) 



4tt d 1 
is s 



1 d : 



Thus, the first few terms of the Maclaurin series of 47r[g t (cos 8)] and 47r[/i t (cos( 

4*g t {<xx0) ~ tr(e" sA ) + ^tr( e - sA ). 

4 ds 



arc 



47r/it(cos / 



4 S ds 2 1 j - 



(61) 
(62) 



In order to put these into a useful form, we now invoke a result of Kannai [27 . 

Set z(8) = (cosd, sin6, Q, . . . ,0) S S" 1 . Since g t (cos8) equals J t (N,z(0)), where J t (x,y) is the kernel of 
the heat operator e~ sA , Kannai's results imply that 



4:ng t (cos( 



-e 2 /4s 



£>(0) f 



(63) 



3=0 
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at least for < \9\ < ir, and where the Vj are smooth functions. (Kannai's result for general M, proved 
through use of the Hadamard parametrix for the wave equation, is that the kernel of e~ sA has the form 



1 



(4 7rs )n/2 



- d (*.f) 2 / 4 * i yv j {x,y)s j 



/ j 3 ' 
3=0 



for x sufficiently close to y, where the Vj are smooth. 

In our case the geodesic distance from N to z{ff) is just and we have set Vj(6) — Vj(N, z(8)).) 
From (63 1 and the fact that (by ([56]) and (|57|), one has 



8 
ds 



5t(cos 9) 



1 



h t (cos6), 



(64) 



we are now motivated to find the first few terms in the Maclaurin series (in 0) of e 82 / 4 M7Tflr t (cos0) and 
e e / 4s 47r/it(cos 9). For this, we need only multiply the right sides of (61 1 and (62 1 by 



2 /4s 



-(s + 9 2 /A+---). 



This yields 



and 



J 2 /As 



Air g t (cos ( 



[str(e- sA ) + —[a— tr(e~ sA ) + tr( £ - sZX )] 



2 / 4s 47r/i t (cos0) 



4 1 ds 



•s- — tr(e" sZX ) + s— tr(e 



d_ 
ds 



-sA\ 



(65) 



where the error, for any fixed s, is 0(9 4 ). Combining this with Polterovich's result (55 1, and putting 
s = t 2 again, we obtain the approximations 



47rg t (cos 9) 



-e 2 /4s 



-[(1 



s 

15 



4s 3 
315 



s 4 9 2 1 2s 
315^ + 4"^3 + 15 



4s2 
105 



4s 3 
315 



)] 



(66) 



It would take considerably more analysis to estimate the error here, but Maple says that when t — .1, the 
error is never more than 6 x 10 -4 for any 9 E [— ir, tt], even though both sides have a maximum of about 
100. Maple says that the greatest error occurs at around 9 = .3, where both sides are about 10.655. 
Similarly one could use ( 65 1 to give an approximation to Anh t (cos 9) . But Maple says that the errors 
are smaller if we differentiate formally with respect to s in (66 1 (and then multiply by — s); here we are 
recalling (64 1. If we do this and finally replace shy t 2 , this yields the approximation 

47rft, t (cos ( 



where 



1 



and 



p(t,6) 

q(t,6) 

Maple says that when t 



t 2 



e 


-0 2 /4t 2 






t 2 


[(1- 




it 6 


t 8 


15 + 


315 


315 


2t 2 


4t 4 


At 6 


15" 


" 105 


+ 315 



-it 2 



)p(t,9)-t 2 q(t,9)} 1 



(67) 



3 2 , 1 2t 2 



4 v 3 



15 



+ 4 ( l5 + 



8^ 
105 



4i^ 
105 H 

At 4 



105 



4r 

315 



This approximation differs from the one obtained from (651 only in terms which are fourth order in 9. 

.1, the error in the approximation (67) is never more than 9.5 x 10 -4 for any 
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9 6 [— 7r, 7r], even though both sides have a maximum of about 100. Maple says that the greatest error 
occurs at around 9 = .4, where both sides are about -5.593. Of course, if in (67 1 we approximate p ~ 1 
and q ~ 0, we would obtain the formula for the usual Mexican hat wavelet on the real line, as a function 
of 9. 

Let us then explain how one can readily obtain any number of terms of the Maclaurin series of gt and h t . 

Lemma 6.3. For any positive integer m, there are constants a\, . . . ,a m as follows. If, in the situation 
of Lemma 6.1 u is C 2m on an open interval containing [— 1, 1], then 



j2m 



d9 2r 



1 (cos0)]| 0=o = ^a l (A J C/)(N). 



Moreover, a m ^ 0. 



Proof. It is enough to show that, generalizing (58 1, there are polynomials pi, . . . ,P2m in x\ such that 

m 2rn 

(& m U)(x)=Y J V l (xi)u^(x 1 )+ (l-s?) <_m Pi(a!i)u W (a:i) (68) 

i—1 i— ra+1 

where p m (l) ^ 0. For then the mapping 

K(l),--- ,J m )(l))^((Af/)(N),... ,(A m f/)(N)) 

will be given by an invertible upper triangular matrix. In particular this is true if n = 1, so that the 
mapping 

.« (m) (l)) - f^a[«(co6fl)]| fl=0) ... ,^[tt(cos6l)]| fl= o 
is also given by an invertible upper triangular matrix. Thus the map 



((ACO(N),--- ,(A m C/)(N)) 



[u(cos ( 



72 m 



0=0 1 ' ' ■ 



cf6» 2 



[w(cos ( 



0=0 



is also giv en b y an invertible upper tri ang ular matrix, which is the desired result. 

To prove (68 1, we recall first that by (58 1, (AU)(x) = Du(x\), where D = nx\-^ — (1 — From 
this, (68l, save for the statement that p m (l) ^ 0, follows at once by a simple induction on m. If p m (l) 
were zero, then the mapping 



(«'(!),. •■ ,u (m) (l)) ^((Af/)(N),-.. ,(A m C/)(N)) 



would be given by a singular upper triangular matrix, so its range would not be all of 1 
is all of R m , since it contains all the vectors 



But the range 



((AZ,)(N), . . . , (A m Z t )(N)) = Z,(N)i(i + n - 1)(1, l(i + n - 1), . . . , [1(1 + n- l)]™" 1 ) , 

for I — 1, . . . , m, and since the rows of a Vandermonde matrix are linearly independent. This contradiction 
completes the proof. □ 

To obtain further terms of the Maclaur in s cries of gt or h t , one need only use Lcmma |6.3| togcthcr with the 
fact that applying A z to the series in (56 1 or (57 1 is the same as applying (—d/ds) 1 . Then Polterovich's 
formula for the heat trace asymptotics may be used. 
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Remark 6.4. Throughout this discussion, we have been considering (|£5j) for f(s) — se~ s . If one instead 
used fm(s) = s m e~ s , where m is a nonnegative integer, one could still obtain explicit formulas, similar 
to (61), giving "generalized" Mexican hat wavelets on the spher e.) Indeed, if h™ corresponds to f m in the 
same way that h t corresponds to f = fi, we have in place of (64) that J^g t (cos9) = ^2 /i™(cosfl). 
Using f m in place of f has certain advantages. In particular, considering functions (of A) which vanish 
more quickly at (such as the f m ) is very important in the characterization of Besov spaces in 1 19f . 



7 Appendix: A Technical Lemma 

In the proof of Lemma [4. 1| we used the following fact. 

Lemma 7.1. Suppose clq, . . . , G C. Then there exists an even function u G <S(K), such that u^ 2l \0) = 
oj, whenever < I < L, and such that suppu C (— 1, 1). (Of course, all odd order derivatives of u vanish 

at o.; 

Proof. We construct h — u. We need to show that, given numbers bo, . . . , &l, there exists h G C£°(R), h 
even, supp h C (—1, 1), with £ 2l h(i;)d£ = bi for < I < L. Select an even function n G C£°(R), with 
supp?7 C (—1, 1), and / rj — 1/2. Select L+l different numbers r\, . . . , r^+i G (0, 1/2). For < t < 1, con- 
sider the even function r?t,m(£) = t~ 1 [ r l(( x +' r m)/t)+T)((x— r m )/t)]. We claim that if t is sufficiently small, 
we may construct our h by setting h — X^mtA c mVt,m for suitable c\, . . . cl+i- Indeed, as t — + + , r)t,m be- 
comes increasingly concentrated near r m and near — r m , so that J C 2 'Em=i c mVt,m]d^ — > X)m=i c m{ r m) 21 - 
The existence of suitable ci, . . . cl+i now follows from the nonvanishing of small perturbations of the Van- 
dermonde determinant. □ 
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